Querying Uncertain Data in Heterogeneous Databases

نویسندگان

  • Pauray S. M. Tsai
  • Arbee L. P. Chen
چکیده

In heterogeneous databases the user may issue a query to join two relations in di erent databases on the keys However the keys may be incompatible In this paper we extend our results on probabilis tic query processing to consider joining two relations on incompatible keys A new approach to identify the same entities in di erent relations is proposed Various data and schema con icts such as missing data inconsistent data and domain mismatch are considered in the identi cation process Probabilis tic techniques are used to estimate the sameness of two entities to process queries and to estimate the degree of uncertainty for the query results

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extending dynamic queries to handle uncertain data

Dynamic querying is a technique which has been used successfully to enable novice users to gain access to and insight into data in databases. Some multimedia archives (such as archives of African art) contain data which have vague locations in time and space, that is, although there is some idea of when and where the entity originated, the precise information is unknown. This uncertainty create...

متن کامل

Querying Heterogeneous Databases Using Standardized Schemas and SQL

Making databases available for querying both within and across organizations is a vision held by many. Realizing this vision, however, is usually hampered by the existence of heterogeneous database systems, heterogeneous query languages and heterogeneous data semantics. What is needed is a uniform method for accessing these databases. This paper presents a standards based approach in which SQL ...

متن کامل

Querying Heterogeneous Mediated Sources: A Survey

Data integration systems allow access to information in increasingly different forms: relational databases, spreadsheets, web pages, and so on. Querying such heterogeneous sources is challenging due to non-uniform query capability of sources, variety of schema and data models, and limitations on access paths. Most systems use some form of mediation to allow access to heterogeneous sources. Some...

متن کامل

Querying Nested Historical Relations in Heterogeneous Databases Environment

We study schema integration problems for consolidating historical information from nested relational databases in heterogeneous databases environment. These nested relations are for supporting complex objects. In heterogeneous databases systems, probabilistic partial values have been used to resolve some schema integration problems. In this paper, we extend the concept of probabilistic partial ...

متن کامل

Indexing the Earth Mover's Distance Using Normal Distributions

Querying uncertain data sets (represented as probability distributions) presents many challenges due to the large amount of data involved and the difficulties comparing uncertainty between distributions. The Earth Mover’s Distance (EMD) has increasingly been employed to compare uncertain data due to its ability to effectively capture the differences between two distributions. Computing the EMD ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993